NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Theoretical Analysis of Weak-to-Strong Generalization

Lang, Hunter; Sontag, David; Vijayaraghavan, Aravindan (December 2024, The Thirty-eighth Annual Conference on Neural Information Processing Systems (NeurIPS) 2024)

Full Text Available
Learning to Decode Collaboratively with Multiple Language Models

Shen, Shannon Zejiang; Lang, Hunter; Wang, Bailin; Kim, Yoon; Sontag, David (August 2024, Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (ACL))

We propose a method to teach multiple large language models (LLM) to collaborate by interleaving their generations at the token level. We model the decision of which LLM generates the next token as a latent variable. By optimizing the marginal likelihood of a training set under our latent variable model, the base LLM automatically learns when to generate itself and when to call on one of the “assistant” language models to generate, all without direct supervision. Token-level collaboration during decoding allows for a fusion of each model’s expertise in a manner tailored to the specific task at hand. Our collaborative decoding is especially useful in cross-domain settings where a generalist base LLM learns to invoke domain ex- pert models. On instruction-following, domain- specific QA, and reasoning tasks, we show that the performance of the joint system exceeds that of the individual models. Through qualitative analysis of the learned latent decisions, we show models trained with our method exhibit several interesting collaboration patterns, e.g., template-filling.
more » « less
Full Text Available
Graph cuts always find a global optimum for Potts models (with a catch)

Lang, Hunter; Sontag, David; Vijayaraghavan, Aravindan (January 2021, Proceedings of the Thirty-eighth International Conference on Machine Learning (ICML))
null (Ed.)
Full Text Available
Graph cuts always find a global optimum for Potts models (with a catch)

Lang, Hunter; Sontag, David; Vijayaraghavan, Aravindan (January 2021, Proceedings of the Thirty-eighth International Conference on Machine Learning (ICML))
null (Ed.)
Full Text Available
Beyond Perturbation Stability: LP Recovery Guarantees for MAP Inference on Noisy Stable Instances

Lang, Hunter; Reddy, Aravind; Sontag, David; Vijayaraghavan, Aravindan (January 2021, Proceedings of The 24th International Conference on Artificial Intelligence and Statistics, PMLR)
null (Ed.)
Full Text Available
Beyond Perturbation Stability: LP Recovery Guarantees for MAP Inference on Noisy Stable Instances

Lang, Hunter; Reddy, Aravind; Sontag, David; Vijayaraghavan, Aravindan (January 2021, Proceedings of The 24th International Conference on Artificial Intelligence and Statistics, PMLR)
null (Ed.)
Full Text Available
Block Stability for MAP inference

Lang, Hunter; Sontag, David; Vijayaraghavan, Aravindan (January 2019, Proceedings of Machine Learning Research)

Full Text Available
Optimality of Approximate Inference Algorithms on Stable Instances

Lang, Hunter; Sontag, David; Vijayaraghavan, Aravindan (January 2018, Proceedings of Machine Learning Research)

Full Text Available

Search for: All records